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INTRODUCTION 


Overall  Goals 


Although  the  storage  phosphor  imaging  technique  has  become  a  promising 
candidate  for  large  scale  implementation  of  digital  radiography  in  medium  resolution 
applications  such  as  chest  radiography  and  abdominal  imaging *'7,  it  does  not  have 
sufficient  spatial  resolution  capability  for  mammographic  imaging  applications^, 93 
This  is  partially  due  to  the  limitation  of  the  screen  optics^-lO.  A  more  important  cause, 
which  can  be  resolved,  is  the  use  of  large  laser  beam  spot  size  with  most  commercial 
storage  phosphor  image  reader^' 13. 

In  this  project,  we  propose  to  modify  and  optimize  an  experimental  CR  system  for 
high  resolution  mammographic  imaging  applications.  Image  properties  of  the  optimized 
system  will  be  characterized  by  measuring  the  signal  response,  MTF  and  noise  properties. 
A  comprehensive  comparison  study  based  on  phantoms  will  be  conducted  to  evaluate  and 
compare  the  CR  technique  with  the  conventional  screen-film  technique  for  detection  of 
microcalcifications  as  well  as  low  contrast  soft  tissue  masses.  Possibility  for  patient  dose 
reduction  will  also  be  investigated.  In  an  addendum  to  our  proposal  dated  October  26, 
1992,  we  also  proposed  to  perform  preliminary  investigations  on  data  compression  of 
mammographic  or  similar  types  of  images. 

Main  Tasks 


The  project  consists  of  four  major  components:  (1)  system  modification  and 
optimization,  (2)  system  characterization  and  quality  control,  (3)  comparison  study  and 
(4)  image  compression  study.  To  this  date,  Tasks  1,  2  and  4  have  been  largely  completed 
although  quality  control  of  the  system  is  an  ongoing  endeavor.  Task  3  is  about  50% 
completed.  Specific  tasks  completed  in  the  first  and  second  years  are  summarized  as 
follows: 

In  the  first  year  of  the  grant  period,  we  have  begun  and  completed  the  following 

tasks: 

1.  We  modified  the  laser  optics  and  reduced  the  beam  spot  size  from  120p.m  to  50p.m. 
We  measured  and  confirmed  the  reduction  of  spot  size. 

2.  We  measured  and  compared  the  resolution  capability  of  the  modified  imaging  system 
with  that  of  the  unmodified  system. 

3.  We  investigated  the  methods  of  constructing  a  suitable  phantom  for  later  on 
comparison  studies. 

4.  We  implemented  and  tested  several  image  compression  schemes  and  applied  them  to 
digitized  mammograms. 
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In  the  second  year  of  the  grant  period,  we  have  accomplished  the  following  tasks: 

1.  We  have  improved  the  spatial  resolution  capability  of  our  storage  phosphor  image 
reader  by  further  reducing  the  laser  spot  size  from  50jim  (obtained  in  first  year)  to  40(1 
m. 

2.  We  have  measured  and  compared  the  MTFs  of  the  improved  system  with  those  of 
commercial  systems. 

3.  We  have  developed  a  phantom  for  monitoring  the  overall  image  quality. 

4.  We  have  constructed  and  tested  phantoms  for  the  comparison  study. 

5.  We  have  begun  acquisition  of  storage  phosphor  and  film  images  for  the  ROC  study. 

6.  We  have  completed  the  test  of  the  compression  algorithms  developed  in  the  previous 
year. 

Methodology  and  results  for  the  second  year  tasks  will  be  described  and  discussed 

in  the  following  sections. 
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EXPERIMENTAL  METHODS 


1.  Further  System  Improvement 

We  have  continued  to  improve  our  experimental  storage  phosphor  imaging  system 
(ESPIS)  for  better  image  quality  and  for  more  convenient  operation.  For  better  spatial 
resolution  properties,  we  have  further  reduced  the  laser  spot  size  from  50pm  to  40pm 
(full  width  at  half  maximum).  We  have  also  acquired  a  number  of  high  resolution  storage 
phosphor  screens  for  use  in  image  acquisition  for  the  ROC  study.  These  screens  include 
the  newest  one  from  Fuji:  HR-V.  We  have  tested  and  shown  that  HR-V  offers  the  same 
spatial  resolution  quality  although  a  higher  light  gain. 

We  have  also  developed  various  techniques  for  measuring  the  parameters  for 
system  operation  and  performance.  For  instance,  we  have  improved  our  technique  for 
measuring  the  laser  beam  intensity  profde  so  that  it  can  be  used  on  a  real-time  basis 
during  the  focusing  procedure.  This  involves  the  development  of  system  software  for 
digitizing,  scaling  and  displaying  the  laser  beam  intensity  signal  at  a  much  faster  rate  (1 
cycle  every  several  seconds).  This  greatly  improves  the  ease  and  accuracy  of  the  focusing 
procedure.  As  the  result,  we  have  been  able  to  reduce  the  laser  spot  size  from  50pm  to  40 
pm.  Further  reduction  is  possible.  However,  it  may  start  to  degrades  the  sampling 
process  and  generate  gaps  where  x-ray  information  would  be  lost  and  image  noise  may 
increase.  Although  we  continue  to  investigate  the  pros  and  cons  of  smaller  spot  size,  we 
have  reached  the  conclusion  that  based  on  practical  considerations,  it  would  be  a  good 
compromise  to  use  40pm  laser  beam  spot  in  our  comparison  study. 

We  have  also  updated  the  acoustic-optical  modulator  (AOM)  to  allow  more 
reliable  and  less  noisy  operation.  A  beam  intensity  monitor  and  feedback  loop  were 
added  to  minimize  the  fluctuation  of  laser  beam  power.  We  have  also  developed  a  series 
of  techniques  for  testing  the  operation  of  ESPIS.  One  of  these  techniques  is  direct 
measurement  of  the  scanning  laser  intensity  pattern.  We  have  been  able  to  use  film  to 
monitor  and  measure  the  spatial  intensity  distribution  of  the  scanning  laser  beam.  This 
technique  has  proved  to  be  useful  in  checking  the  stability  of  the  laser  power  or  the  proper 
function  of  the  AOM. 


2.  Measurement  of  Image  Quality 

We  have  concentrated  on  the  modulation  transfer  function  (MTF)  and  detective 
quantum  efficiency  (DQE)  for  measurement  of  the  image  quality. 

Spatial  Resolution 

We  continued  to  use  a  lead  bar  resolution  pattern  (Nuclear  Associates  07-553)  to 
monitor  and  measure  the  spatial  resolution  quality  of  the  ESPIS.  It  consists  of  22  groups 
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of  bar  patterns  with  a  frequency  ranging  from  0.25  to  10  lps/mm.  Each  group  has  a  4 
cycle  bar  pattern.  A  specification  list  for  the  resolution  bar  pattern  is  attached  as 
Appendix  II. 

The  bar  pattern  images  are  visually  examined  to  evaluate  the  spatial  resolution 
limit  of  the  imaging  system.  A  fully  resolved  pattern  (with  good  contrast)  generally 
indicates  that  the  MTF  is  around  0.05  or  greater  at  that  particular  frequency.  A  partially 
or  barely  resolved  pattern  with  faint  contrast  indicates  that  the  MTF  is  probably  between 
1%  and  5%.  Unresolved  bar  patterns  generally  indicate  that  the  MTF  is  below  1%.  This 
observation  could  vary  from  person  to  person  but  may  be  used  a  crude  guideline  when 
visually  examining  the  bar  pattern  images. 

The  methods  for  measuring  the  spatial  resolution  properties  of  a  storage  phosphor 
imaging  system  have  been  previously  discussed  and  demonstrated^, 12, 13  Our 
methods  focus  on  the  use  a  standard  resolution  test  pattern  as  described  before.  The 
square  wave  response  function  (SWRF)  can  be  measured  by  plotting  the  signal  profile 
across  the  bar  patterns  and  then  measuring  the  average  amplitude  for  each  frequency. 
Because  the  square  waves  contain  high  order  harmonics,  the  SWRF  can  not  be  directly 
used  to  compute  the  MTF  without  correction  for  these  harmonics.  We  have  devised  and 
tested  a  signal  processing  method  to  compute  the  MTF  from  the  signal  profile  of  a  bar 
pattern. 


The  signal  profile  was  first  segmented  into  sections  of  the  same  frequency. 
Sections  from  a  number  of  lines  (20-70)  were  concatenated  together  into  a  big  array  with 
many  more  cycles  of  change.  The  array  was  then  Fourier  transformed  and  a  frequency 
distribution  of  the  signals  was  computed.  This  frequency  distribution  typically,  has  a 
fundamental  peak  followed  by  a  number  of  high  order  harmonics.  These  harmonics 
generally  decrease  in  magnitude  as  the  order  becomes  higher.  To  determine  the  MTF 
value,  the  peak  for  the  fundamental  peak  was  measured  and  normalized  by  dividing  it  by 
the  zero  frequency  component  (mean  of  the  data).  This  normalization  procedure  is 
crucial  for  compensating  for  non-uniform  x-ray  intensity  or  signal  gain  across  the  bar 
pattern. 

Detective  Quantum  Efficiency 

Detective  quantum  efficiency  (DQE)  is  of  measured  as  a  function  of  the  spatial 
frequency  and  used  to  characterize  the  capability  of  an  imaging  system  to  preserve  the 
signal-to-noise  ratio  (SNR)  in  the  input  image  signals.  DQE  is  generally  defined  as 
follows: 


dqe  =  ^4 

snrJ 
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where  SNRin  and  SNRi0Ul  are  the  image  signal-to-noise  ratios  at  the  input  and  output.  To 
measure  the  DQE  as  a  function  of  the  spatial  frequency,  f  (lps/mm),  we  used  the 
following  expression  for  the  frequency  dependent  DQE: 


DQE(f)  = 


(  1  \ 

f  s 2  N 

U^2J 

{NPS(f)) 

■MTF(f)2 


where  NPS(f)  is  the  noise  power  spectrum,  S  is  the  average  signal  over  the  area  where 
NPS(f)  is  measured,  MTF(f)  is  the  modulation  transfer  function.  This  equation  is  similar 
to  those  previously  used  in  literature  on  DQE  measurement*^,  1  1,15 

To  compute  the  DQE(f),  NPS(f)  is  computed  from  image  data  over  an  area  with 
largely  uniform  signals  while  SNRin  is  computed  from  the  number  of  photons  which  is 
estimated  from  the  x-ray  techniques  used. 


3.  Phantom  Images 

Image  quality  of  a  mammographic  unit  is  often  checked  with  phantoms  simulating 
a  breast  with  the  three  major  disease  symptoms:  microcalcifications  and  masses.  Various 
phantoms  have  been  designed  for  this  purpose.  We  have  used  two  commercial  phantoms 
to  test  the  image  quality  and  compare  different  imaging  systems  or  technical  factors.  The 
first  one  is  a  Tissue-Equivalent  Breast  Phantom  made  by  the  Computerized  Imaging 
Reference  Systems  (CIRS),  Inc.  of  Norfolk,  Virginia.  It  consists  of  a  breast  shape 
phantom  made  of  tissue  equivalent  material,  with  a  step  wedge,  groups  of 
microcalcifications,  masses  and  fibrils  embedded  inside  the  phantom.  A  specification 
sheet  for  the  phantom  is  attached  as  Appendix  II.  The  step  wedge  consists  a  square 
region  with  a  contrast  corresponding  to  additional  5mm  of  glandular  tissue  over  the 
surrounding  background  tissue  and  a  second  square  region  with  a  contrast  corresponding 
to  additional  5mm  of  fat  tissue.  The  12  microcalcification  groups  ranges  from  0.12mm  to 
0.35mm  in  diameter.  The  6  masses  range  from  1mm  to  6mm  in  diameter.  The  fibrils 
have  a  diameter  of  8.7pm.  This  phantom  provides  a  good  tool  for  comparing  the  overall 
image  quality  of  different  imaging  systems,  techniques  or  technical  factors. 

The  second  phantom  is  the  RMI  Anthropomorphic  Breast  Phantom  (Model  165, 
Gammex-RMI).  The  phantom  consists  of  a  breast  tissue  equivalent  (50%  adipose,  50% 
glandular)  plastic  casting  and  a  high  resolution  detail  layer  enclosed  in  a  protective  acrylic 
case.  Together,  these  components  produce  radiographic  images  that  are  similar  to  a  true 
mammogram..  To  aid  the  evaluation  of  system  performance,  the  phantom  also  produces 
a  nine  level  gray  scale  step  wedge  and  a  group  of  resolution  line  pairs  in  the 
mammogram.  The  phantom  is  unique  in  that  the  characteristics  of  its  components  were 
derived  from  an  actual  mammogram.  The  three  dimensional  surface  structure  of  the 
breast  tissue  equivalent  plastic  casting  produces  the  low  and  medium  resolution  image 
detail.  High  resolution  image  details  are  produced  by  an  attenuating  coating  of  a  stable 
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mercury-silver  amalgam  on  photographic  film.  In  addition  to  protecting  the  contents,  the 
acrylic  case  acts  as  a  uniform  attenuation  layer  across  the  entire  phantom 

The  advantage  of  using  the  above  two  phantoms  is  that  they  more  or  less  simulate 
a  real  breast  in  tissue  thickness,  composition  and  even  structures.  Thus  they  allow 
performance  check  in  realistic  imaging  situation.  However,  they  are  more  suitable  for 
quality  control  than  for  critical  comparison  of  image  quality.  The  simulated 
microcalcifications  in  the  CIRS  phantom  have  a  size  of  at  least  120[im.  In  addition,  the 
sizes  of  different  groups  increment  by  40pm.  To  facilitate  more  critical  and  quantitative 
comparison  of  image  quality,  we  have  developed  a  phantom  referred  to  as  the  aluminum 
wire  phantom  (AWP).  The  AWP  is  constructed  of  aluminum  wires  of  8  different  sizes:  2, 

3,  4,  5,  6,  7,  8  and  16  mils.  These  wires  were  cut  into  about  1"  long  and  placed  in  radial 
directions  over  a  1"  thick  Lucite  block.  A  group  of  five  short  wire  segments,  cut  from  the 
size  of  wire,  are  placed  near  the  tip  of  each  wire.  This  phantom  allows  subtle  difference 
between  different  imaging  systems  or  techniques  to  be  studied. 

For  comparison  study  or  quality  control,  these  phantoms  are  normally  imaged 
with  X-ray  techniques  often  used  with  screen-film  combinations  in  clinical  studies. 

These  factors  were  generally  determined  by  setting  the  kVp  desired  and  then  exposing  a 
Min-R  screen-film  cassette  in  the  automatic  exposure  (mAs)  mode.  The  film  was  then 
processed  and  the  resulting  film  image  examined  to  check  the  density  of  the  image.  The 
kVp  and  mAs  are  recorded  and  used  for  storage  phosphor  imaging  if  the  film  density 
appear  to  be  adequate.  For  the  CIRS  phantom,  the  proper  techniques  for  the  Min-R 
screen-film  combination  are  208  mAs  at  25  kVp  with  an  antiscatter  used.  For  the 
aluminum  wire  phantom,  the  techniques  are  usually  set  at  25  kVp  and  26  mAs  with  grids. 

4.  Image  Acquisition  for  ROC  Study 

In  the  proposed  project,  we  will  conduct  a  ROC  study  to  compare  the  improved 
storage  phosphor  imaging  technique  with  the  conventional  screen-film  technique.  The 
study  will  be  based  on  phantom  images  with  simulated  tissue  structures, 
microcalcifications,  and  masses.  To  present  more  realistic  imaging  conditions,  these 
objects  will  be  superimposed  with  structures  mimicking  tissue  structures  of  breasts.  In 
the  second  year,  we  have  finalized  our  design  and  constructed  a  phantom  for  generating 
images  for  the  ROC  studies.  Actual  image  acquisition  using  this  phantom  has  started  and 
is  expected  to  be  completed  in  January  of  1995.  Methods  for  phantom  construction  and 
image  acquisition  will  be  described  and  discussed  in  this  section. 

Phantom  Construction 

The  main  body  of  the  phantom  is  supposed  to  provide  a  background  structure  over 
which  microcalcifications  and  masses  will  be  simulated  and  superimposed.  We  have 
investigated  several  approaches  to  construct  the  main  body  of  the  phantom.  The  final 
design  is  a  large  1"  thick  piece  of  beef  housed  in  a  specifically  designed  plastic  tank.  This 
design  is  intended  to  simulate  a  well  compressed  breast.  The  meat  produces  a  structural 
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Figure  1  Layout  of  the  film  holder  for  the  meat  phantom  used  to  acquire  images  for  the 
ROC  study.  A  matrix  of  3x4  1.5"xl.5"  film  inserts  can  be  placed  on  the  holder 
to  provide  12  independent  reading  tasks.  Examples  of  the  microcalcification 
placement  are  shown  in  the  windows  #2,3,5,7,8,9,10,1 1,12. 
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pattern  different  form  real  mammograms  but  still  provides  adequate  challenge  to  the  tasks 
of  detecting  simulated  microcalcifications  or  masses.  The  meat  can  be  easily  replaced  to 
produce  a  different  structural  background  so  that  the  reader  has  no  opportunity  to  become 
"trained"  for  a  certain  pattern.  The  tank  is  filled  with  pure  alcohol  to  preserve  the  meat 
for  longer  time.  The  thickness  of  the  tank  is  minimized  to  about  0.5".  Thus,  the  phantom 
has  a  total  thickness  of  1.5". 

Mircocalcifications  (MC)  are  simulated  by  a  group  of  5  or  6  aluminum  wire 
segments  (spread  over  an  area  of  2-3  mm  wide)  placed  on  a  a  1.5"xl.5"  piece  of  plastic 
film  (cut  from  acetone  transparency).  The  MCs  are  simulated  with  three  different  sizes: 

4,  5  and  6  mils(100,  125  and  150|i.m).  Other  sizes  weren't  used  because  visual  tests  have 
indicated  that  MCs  larger  than  6  mil  are  always  visible  while  those  smaller  than  4  mil  can 
never  be  seen  under  clinical  imaging  conditions.  For  each  size,  many  film  pieces  were 
made  with  the  MCs  located  at  various  locations.  These  films  can  also  be  rotated  by 
multiples  of  90  degrees  to  further  randomize  the  location  of  the  MC  groups.  Films 
without  any  MC  were  also  made  to  simulate  normal  cases.  A  film  holder  is  constructed 
of  a  3/8"  thick  piece  of  Lucite  with  a  3x4  matrix  of  1.5"xl.5"  holes  cut  out..  During 
image  acquisition,  the  holder,  with  12  selected  film  inserts,  is  placed  underneath  the  meat 
phantom  for  exposure.  Figure  1  shows  the  layout  of  these  cutouts.  Films  8  and  9  show 
examples  of  the  x-ray  images  of  the  meat  phantom  with  simulated  MCs. 

Statistical  analysis  in  the  proposal  has  indicated  that  240  observations  (readings) 
are  required  to  yield  reasonable  significance  of  the  ROC  study.  Our  phantom  design 
provides  12  "windows".  Within  each  of  these  windows,  an  observation  or  reading  can  be 
performed  to  determine  whether  and  where  there  are  MCs.  Thus,  each  image  will 
provide  12  independent  readings  and  a  total  of  20  different  images  will  be  required.  A 
total  of  240  inserts  are  required  to  acquire  these  20  images  (12  for  each  of  them).  Among 
the  240  film  inserts  used,  40  have  no  MCs,  40  have  6  mil  MCs,  80  have  5  mil  MCs  and 
80  have  4  mil  MCs.  These  inserts  are  randomly  selected  by  a  computer  program  to  be 
used  in  each  image  acquisition  procedure. 

The  masses  are  simulated  by  circular  cutout  of  tape  attached  to  the  film  insert. 

The  size  and  thickness  of  the  wax  drops  can  be  varied  to  simulate  masses  different  sizes 
and  contrast.  To  produce  marginal  visibility,  hree  different  combinations  of  thickness 
and  size  will  be  simulated  and  studied:  (A)  2mm  wide  double  tape,  (B)  2mm  wide  single 
tape  and  (C)  1mm  single  tape.  240  inserts  containing  simulated  masses  are  used.  Similar 
to  the  microcalcification  study,  40  inserts  have  no  mass,  40  have  type  A  masses,  80  have 
type  B  masses  and  80  have  type  C  masses.  Selection  of  film  inserts  for  image  acquisition 
is  controlled  and  randomized  by  computer  program. 

Image  Acquisition 

A  clinical  mammographic  unit  (LoRAD  M-III)  is  used  for  image  acquisition.  The 
x-ray  tube  uses  a  molybdenum  target.  A  5: 1  reciprocating  antiscatter  grid  is  used.  With 
the  phantom  in  place,  a  screen-film  image  is  first  taken  using  Kodak  Min-R  screen  and 
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Min-R  E  film  in  the  autotimed  mode.  The  kVp  is  kept  fixed  at  25  kVp  and  the  mAs  is 
controlled  by  photo-timing.  Following  the  exposure,  the  mAs  will  be  recorded.  The 
exposed  film  will  then  be  developed  and  checked  for  proper  exposure.  If  the  film 
exposure  is  adequate,  the  storage  phosphor  cassette  will  be  inserted  and  exposed  using 
identical  x-ray  techniques.  The  exposed  storage  phosphor  screen  will  then  be  scanned 
with  a  pixel  size  of  43fim  to  generate  a  4096x5000x12  bit  digital  image.  The  image  will 
first  be  printed  with  a  80pm  pixel  size  using  a  Kodak  Ektascan  laser  printer.  The  film 
will  be  examined  and  checked  for  any  artifacts  or  phantom  position.  If  acceptable,  the 
phantom  will  be  fitted  with  a  different  set  of  film  inserts  and  imaged  again  following  the 
same  procedure. 

Quality  Control 

Quality  control  of  the  storage  phosphor  imaging  system  has  been  performed  by 
acquiring  and  visually  checking  images  of  the  resolution  bar  pattern,  CIRS  phantom;  RMI 
phantom  and  the  specially  designed  aluminum  wire  phantom.  Because  some 
improvements  have  been  made  on  the  system,  the  criteria  for  visual  check  has  slightly 
changed  now.  In  general,  for  the  resolution  bar  pattern  image,  we  expect  to  see  and 
resolve  all  bars  at  frequencies  of  up  to  8.5  lps/mm  fully  and  clearly.  The  10  lps/mm  bars 
should  also  be  fully  resolved  but  with  subdued  contrast.  Partially  resolution  of  the  10 
lps/mm  bars  is  an  indication  of  presence  of  structural  artifacts.  The  images  of  the  CIRS 
phantom  provides  a  convenient  semi-quantitative  method  for  checking  the  overall  quality 
of  the  system,  including  the  spatial  resolution  and  noise  properties.  Same  technique 
(25kVp,  208 mAs,  65cm  SID,  with  antiscatter  grids)  will  be  used  in  all  quality  control 
exposures.  In  general,  we  expect  to  see  all  microcalcifications  with  a  diameter  of  200pm 
or  greater  and  all  masses  with  a  diameter  of  2mm  or  greater.  All  fibrils  should  be  fully 
resolved.  The  RMI  phantom  offers  a  more  qualitative  check.  All  images  should  have 
similar  apparent  quality  as  previous  images.  The  aluminum  wire  phantom  provides  a 
finer  technique  to  grade  the  system  performance.  At  25  kVp  and  26  mAs,  we  expect  to 
resolve  the  1"  long  wires  as  thin  as  50pm  in  diameter  and  small  wire  segments  as  thin  as 
125pm. 


Quality  control  of  mammographic  X-ray  units  and  the  screen-film  combinations 
have  been  previously  discussed1 10,1 1 1,  The  quality  of  the  mammographic  X-ray  unit  is 
routinely  monitored  and  maintained  by  the  clinical  staff. 

5.  Image  Data  Compression 

The  data  compression  procedure  is  divided  into  two  parts:  preprocessing  and  Joint 
Photographic  Experts  Group  (JPEG)  compression.  They  are  summarized  in  this  section. 
A  detailed  description  and  discussion  of  the  methods  are  presented  in  the  manuscript 
reprint  attached  as  Appendix  V. 


12 


Preprocessing 


Before  applying  the  JPEG  compression  algorithm,  the  image  data  is  preprocessed 
to  improve  its  compressibility.  This  preprocessing  operation  is  divided  into  two  phases; 
first  the  image  is  segmented  and  cropped  and  the  background  pixel  values  are  modified  to 
minimize  the  storage  requirements  for  the  background.  We  begin  the  segmentation 
process  by  automatically  determining  a  threshold  value  from  the  histogram  of  the 
digitized  source  image  that  indicates  the  transition  between  background  values  and  tissue 
values.  This  threshold  value  is  used  to  classify  pixels  in  the  image  as  either  tissue  or 
background  so  as  to  produce  a  two-valued  mask  corresponding  to  the  tissue-background 
separation. 

The  second  phase  of  preprocessing  involves  the  application  of  a  noise-removal 
filter  to  the  tissue  area  of  the  image.  The  particular  filter  tested  here  is  a  nonlinear  order 
filter  that  was  designed  to  modify  mainly  pixels  with  large  noise  components  while 
leaving  the  majority  of  pixels  unchanged. 

JPEG  Compression 

The  JPEG  compression  algorithm  used  in  this  study  is  a  software  package  written 
and  tested  in  our  laboratory  that  implements  the  12-bit  version  of  the  extended  JPEG 
standard.  For  this  study,  Huffman  encoding  was  used  to  encode  the  quantized 
coefficients  and  all  compression  ratios  were  based  on  this.  The  Huffman  tables  were 
derived  from  the  statistics  of  each  individual  quantized  image  rather  than  on  the  ensemble 
statistics.  Signaling  information  was  not  included  in  the  code  size  when  calculating 
compression  ratios. 

Evaluation 


The  compression  technique  is  evaluated  by  its  speed,  compression  ratio  and 
ability  to  preserve  image  information.  Measurement  of  the  compression  and 
decompression  speed  is  straightforward.  Ther  results,  however,  are  dependent  upon 
individual  images.  Therefore,  we  evaluated  and  compared  the  average  speeds  for  the  8 
images  used  in  our  study. 

The  purpose  of  preprocessing  is  to  increase  the  information  content  in  the  image. 
Thus,  we  measured  the  average  number  of  bits  for  each  8x8  pixel  block  of  tissue  area 
before  and  after  segmentation.  An  improvement  factor  is  computed  as  the  ratio  of 
average  number  of  bits  before  the  segmentation  to  that  after  the  segmentation.  We  also 
measured  the  percentage  of  image  area  with  breast  tissue  signals. 

The  compression  ratio  and  ability  for  information  preservation  are  not 
independent  of  each  other.  It  is  reasonable  to  expect  better  information  preservation 
when  the  compression  ratio  is  low  and  vice  versa.  We  used  a  just  noticeable  difference 
(JND)  study  to  quantify  the  information  preservation  as  a  function  of  the  compression 
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ratio.  From  eight  original  mammograms,  which  had  each  been  compressed  at  five 
compression  ratios,  a  set  of  40  randomized  pairs  of  images,  each  consisting  of  one  laser 
printed  non-compressed  image  and  one  compressed  image  were  assembled.  20  readers 
were  asked  to  view  the  pairs  side  by  side  and  forced  to  choose  the  compressed  image. 
The  readers  include  experienced  radiologists,  physicists  and  engineers  specializing  in 
medical  imaging.  Following  the  study,  the  results  are  sorted  according  the  compression 
ratio  and  the  percentage  of  corrected  selected  pairs  is  plotted  as  a  function  of  the 
compression  ratio  for  a  group  of  "expert  readers"  and  a  group  of  "average  readers". 
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RESULTS  AND  DISCUSSION 


1.  Spatial  Resolution  Comparison 

Film  1  shows  a  storage  phosphor  image  of  a  resolution  test  pattern  (Nuclear 
Associates,  Model  07-553)  acquired  with  a  high  resolution  storage  phosphor  screen  and 
scanned  with  the  modified  storage  phosphor  image  reader  with  a  reduced  beam  spot  size 
of  40pm  and  a  pixel  size  of  43pm.  Notice  that  the  10  lps/mm  bars  are  clearly  rendered. 
The  horizontal  resolution  is  slightly  better  than  the  vertical  one.  Film  2  shows  a  storage 
phosphor  image  of  the  resolution  test  pattern  acquired  with  a  14"xl7"  standard  resolution 
storage  phosphor  screen  and  scanned  with  a  commercial  storage  phosphor  image  reader 
(KESPR,  Eastman  Kodak  Company).  The  pixel  size  is  173pm.  Notice  that  bar  pattern  is 
resolved  at  up  to  2.4  lps/mm.  Although  there  is  ample  contrast,  serious  aliasing  effects 
are  present  at  frequencies  above  2.4  lps/mm.  Film  3  shows  a  storage  phosphor  image  of 
the  resolution  test  pattern  acquired  with  a  8"xl0"  high  resolution  storage  phosphor  screen 
(Fuji  HRIIIn)  and  scanned  with  a  commercial  storage  phosphor  image  reader  (KESPR, 
Eastman  Kodak  Company).  The  pixel  size  is  100pm.  Notice  that  bar  pattern  is  resolved 
at  up  to  5  lps/mm.  Although  there  is  some  contrast,  serious  aliasing  effects  are  present  at 
frequencies  above  5  lps/mm. 

These  images  demonstrate  that  our  improved  ESPIS  results  in  much  better  spatial 
resolution  than  those  obtained  with  commercial  storage  phosphor  imaging  systems.  We 
have  also  measured  and  compared  the  MTFs  for  the  above  three  different  imaging 
systems.  The  results  are  shown  in  Figure  2.  The  comparison  of  the  MTFs  also  show  that 
our  improved  ESPIS  results  in  much  better  spatial  resolution  properties.  The  poorer 
spatial  resolution  of  the  tested  commercial  systems  is  due  to  two  factors:  the  large 
sampling  size  and  large  laser  beam  spot  size.  The  bar  pattern  image  in  Film  1  has  shown 
some  improvement  when  compared  to  those  obtained  in  1993  (see  1993  report).  The 
improvement  may  be  attributed  to  finer  focusing  and  a  reduction  of  the  laser  spot  size 
from  50pm  to  40pm.  The  MTF  measured  for  the  improved  ESPIS  is  comparable  to  other 
digital  mammography  techniques  based  on  a  combination  of  x-ray  conversion  and  light 
detection  methods.  This  is  probably  due  to  the  fact  that  image  blurring  from  light 
diffusion  in  x-ray  phosphors  sets  a  fundamental  limit  on  the  MTFs  of  all  this  category  of 
systems,  including  our  ESPIS. 

We  have  also  measured  the  MTF  as  a  function  of  the  laser  beam  power,  ranging 
from  8  to  12  mW.  The  results,  shown  in  Figure  3,  indicate  that  there  is  no  dependence  on 
the  laser  beam  power  in  the  useful  range  of  8-12  mW.  It  should  be  noted  that  outside  this 
range,  the  power  may  be  too  strong  to  avoid  overheating  or  even  burning  the  phosphor  or 
it  may  be  too  low  for  the  position  encoding  to  properly  function. 

/ 

2.  Detective  Quantum  Efficiency  (DQE) 
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Figure  2  MTFs  for  three  different  storage  phosphor  imaging  systems:  "o":  Fuji  HRIIIn 
screen  scanned  by  an  experimental  storage  phosphor  image  reader  with  a  pixel 
size  of  43pm  and  a  spot  size  of  40pm,  Kodak  standard  resolution  screen 
scanned  by  KESPR  with  a  pixel  size  of  173pm,  and  "o":  Fuji  HR  IHn  screen 
scanned  by  KESPR  with  a  pixel  size  of  100pm.  The  spot  size  for  KESPR  is 
estimated  to  be  between  120  and  200pm. 


MTF  of  FJ  plates,  LP:  8(*),  10(+)  12(x)and  16(o)  images 
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Figure  3  Effect  of  laser  power  on  the  MTF.  The  MTF  was  measured  for  various  laser 
powers,  ranging  from  8  to  16  mW:  8  mW, 10  mW,  "x":  12  mW  and 

"o":  16  mW. 
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Noise  Power  Spectrum 


Figure  4  Noise  power  spectrum  computed  from  image  data  in  an  area  of  largely  uniform 
brightness.  The  spectum  was  normalized  by  dividing  it  by  the  square  of  the 
average  signal. 
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We  have  measured  the  noise  power  spectrum  (NPS)  and  detective  quantum 
efficiency  (DQE)  for  our  ESPIS.  Figure  4  shows  the  noise  power  spectrum  computed 
from  the  image  data  in  a  largely  uniform  region.  The  x-ray  techniques  used  in  acquiring 
the  image  data  are  25  kVp,  10  mAs,  with  grids  at  an  SID  of  65  cm.  The  resulting  detector 
exposure  is  approximately  6  mR.  Notice  that  because  of  image  blurring,  the  noise 
spectrum  is  no  longer  white.  Instead,  it  decreases  with  increasing  frequency.  Figure  5 
shows  the  DQE  plotted  as  a  function  of  the  frequency.  We  are  currently  refining  our 
technique  for  the  DQE  measurement.  We  are  also  trying  to  compare  our  results  with 
those  done  at  other  institutions. 

3.  Phantom  Images  for  Performance  Comparison 

To  check  the  image  quality  following  the  improvement  of  the  ESPIS,  we  have 
acquired  images  with  a  CIRS  phantom  and  a  specially  designed  aluminum  wire  phantom. 
Film  4  shows  a  storage  phosphor  image  of  a  CIRS  breast  phantom  acquired  with  a  high 
resolution  storage  phosphor  screen  (Fuji  HRIIIn)  and  scanned  with  the  improved  ESPIS. 
The  x-ray  techniques  used  are  identical  to  those  used  in  obtaining  the  screen-film  image 
in  Film  5.  Notice  that  most  calcifications  (with  a  diameter  of  8  mils  or  greater)  and 
masses  (with  a  diameter  of  2  mm  or  greater)  are  well  resolved.  It  should  be  noted  that 
compared  to  images  obtained  in  1993  (see  1993  report)  the  ESPIS  still  can  not  resolve  the 
160pm  or  smaller  MCs  or  1  mm  mass.  However,  it  does  resolve  the  200pm  MCs  and 
1mm  mass  in  a  much  more  clear  way,  indicating  that  the  ESPIS  has  been  improved 
somewhat. 

Film  5  shows  a  creen-film  image  of  the  specially  designed  wire  phantom  acquired 
with  grids  at  25  kVp  and  208  mAs.  The  screen-film  image  shows  the  calcifications, 
masses  and  fabrils  with  a  sharper  appearance  which  is  partially  due  to  its  smaller  (by  a 
factor  of  2)  format.  However,  it  does  not  resolve  more  than  what  can  be  resolved  in  the 
storage  phosphor  image.  This  shows  that  at  typical  clinical  exposure  level  (as  determined 
by  the  automatic  exposure  control),  the  improved  storage  phosphor  imaging  system 
approaches  the  conventional  screen-film  combinations  in  quality  as  far  as  detection  of 
microcalcifications  and  masses  is  concerned. 

Film  6  shows  a  storage  phosphor  image  of  the  specially  designed  wire  phantom 
acquired  with  a  high  resolution  storage  phosphor  screen  (Fuji  HRIIIn)  and  scanned  with 
the  improved  storage  phosphor  image  reader.  The  x-ray  techniques  used  are  identical  to 
those  used  in  obtaining  the  screen-film  image  in  Film  7.  Notice  that  the  5  mil  (125pm) 
wire  segments  are  well  resolved  while  the  4  mil  (100pm)  wire  segments  are  only 
marginally  resolved.  All  but  the  2  mil  (50  pm)  long  wires  (oriented  in  radial  direction) 
are  resolved. 

Film  7  Screen-film  image  of  the  specially  designed  wire  phantom  acquired  with 
grids  at  25  kVp  and  26  mAs.  The  screen-film  image  shows  the  wire  segments  with  a 
sharper  appearance  which  is  partially  due  to  its  smaller  (by  a  factor  of  2)  format.  This 
helps  resolve  the  4  mil  (100pm)  wire  segments.  However,  it  does  not  resolve  finer 
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segments  (2,  3  mil)  or  wires  (2  mil).  This  shows  that  at  typical  clinical  exposure  level  (as 
determined  by  the  automatic  exposure  control),  the  improved  storage  phosphor  imaging 
system  approaches  the  conventional  screen-film  combinations  in  quality  as  far  as 
detection  of  microcalcifications  is  concerned. 

4.  Phantom  Images  for  ROC  Study 

We  have  designed  a  phantom  for  acquiring  images  for  use  in  the  ROC  study.  As 
described  in  the  Methods  section,  the  phantom  allows  the  structural  background  as  well 
as  simulated  MCs  or  masses  to  be  changed.  We  are  currently  in  the  process  of  acquiring 
the  images  for  the  ROC  study.  An  example  set  of  images  are  presented  and  discussed  in 
this  section. 

Film  8  shows  a  storage  phosphor  image  of  the  multiple  window  meat  phantom 
designed  for  image  acquisition  in  this  project.  Groups  of  five  calcifications  (simulated  by 
4,  5  or  6  mil  aluminum  wire  segments)  are  present  in  selected  window  #2,  3,  5,  7,  8,  9, 

10,  1 1  and  12.  5  and  6  mil  calcifications  are  largely  visible  with  different  degrees  of 
difficulty,  depending  on  the  background  structures.  4  mil  calcification  are  generally 
difficult  or  impossible  to  visualize. 

Film  9  shows  a  screen-film  (Min-R  screen  and  Min-R  E  film)  image  of  the 
multiple  window  meat  phantom  designed  for  image  acquisition  in  this  project.  Notice 
that  the  aluminum  wire  segments  (to  simulate  calcifications)  have  a  sharper  appearance  in 
the  screen-film  image.  This  increases  the  ease  for  the  5  and  6  mil  calcifications  to  be 
seen.  However,  4  mil  calcifications  (in  window  #3  and  5)  are  still  difficult  to  detect 
though  perhaps  not  impossible. 

Although  the  final  comparison  of  our  ESPIS  with  the  conventional  screen-film 
combination  awaits  the  results  of  the  ROC  study.  From  the  images  we  have  seen,  it 
appears  that  the  quality  of  the  ESPIS  approaches  that  of  screen-film  systems  when 
imaging  MCs  over  uniform  background.  However,  the  sharper  appearance  of  the  screen- 
film  images  makes  the  task  of  detecting  the  MCs  significantly  easier.  As  the  result, 
detecting  4  or  5  mil  MCs  in  the  screen-film  images  may  have  a  higher  success  rate. 


5.  Data  Compression 

We  will  summarize  our  results  from  the  data  compression  study  in  regard  to  three 
topics:  speed  of  our  compression  system,  performance  of  preprocessing  and  performance 
of  the  JPEG  compression.  A  manuscript  is  attached  as  Appendix  V  for  a  more  detailed 
description  and  discussion  of  the  results. 
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Speed 


The  speed  of  our  compression  system  is  somewhat  image  dependent.  It  averages 
about  20  s/Mbyte  to  preprocess  and  compress  source  data.  Decompression  is  faster  at  15 
s/Mbyte  because  there  is  no  preprocessing  involved.  These  speeds  may  not  be  acceptable 
for  clinical  use  with  a  picture  archival  and  communication  system  (PACS).  However, 
they  can  be  easily  improved  by  using  dedicated  and  optimized  hardware  as  much  as 
possible. 

Preprocessing 

A  summary  of  the  effect  of  image  segmentation  on  the  number  of  bits  needed  to 
represent  tissue  blocks  is  shown  in  Table  1  in  which  the  percentage  of  image  area 
containing  tissue  signals,  bits  per  tissue  block  before  and  after  segmentation,  and  the 
improvement  factor  are  listed  for  each  of  the  8  original  mammograms.  From  these  . 
results,  the  information  content  of  the  image  has  been  improved  by  a  factor  of  2.7  on  the 
average  despite  the  fact  that  the  background  only  represented  36%  of  the  image  area. 

JPEG  Compression 

In  Figure  6,  the  performance  of  image  processing  specialists  and  nonspecialists, 
measured  by  the  ratio  of  correct  identification  of  compressed  images,  is  plotted  as  a 
function  of  the  compression  ratio.  These  results  verify  our  expectation  that  the  higher  the 
compression  ratio  is,  the  more  information  change  there  is  and  therefore  the  easier  one 
can  identify  the  compressed  images.  However,  the  impact  of  data  compression  on 
diagnostic  accuracy  must  be  evaluated  by  a  ROC  study. 
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Table  1 :  The  Effect  of  Image  Segmentation  on  Tissue  Block 
Representation  (bits  per  block) 


Image 

%  of  Area 
Representing 
Tissue 

Bits  per 
Tissue  Block 
(no 

preprocessing) 

Bits  per 
Tissue  Block 
(after 

segmentation) 

Improvement 

Factor 

1 

51.1 

8.6 

30.1 

3.5 

2 

52.9 

8.5 

27.0 

3.2 

3 

70.5 

9.1 

32.3 

3.5 

4 

71.7 

10.0 

28.0 

2.8 

5 

72.4 

9.4 

23.6 

2.5 

6 

56.8 

7.6 

26.5 

3.5 

7 

70.7 

24.2 

27.4 

1.1 

8 

67.0 

18.0 

22.2 

1.2 

Fig  £  Performance  of  image-processing  specialists  and 
nonspecialists  for  the  task  of  identifying  compressed  versus 
noncompressed  images. 


CONCLUSIONS 


In  the  second  year  of  the  grant  period,  we  have  completed  the  following  tasks: 

1.  We  have  improved  the  spatial  resolution  capability  of  our  storage  phosphor  image 
reader  by  further  reducing  the  laser  spot  size  from  50pm  (obtained  in  first  year)  to  40|l 
m. 

2.  We  have  measured  and  compared  the  MTFs  of  the  improved  system  with  those  of 
commercial  systems. 

3.  We  have  developed  a  phantom  for  monitoring  the  overall  image  quality. 

4.  We  have  constructed  and  tested  phantoms  for  the  comparison  study. 

5.  We  have  begun  acquisition  of  storage  phosphor  and  film  images  for  the  ROC  study. 

6.  We  have  completed  the  test  of  the  compression  algorithms  developed  in  the  previous 
year. 


We  have  demonstrated  with  phantom  images  that  the  quality  of  the  storage 
phosphor  images,  obtained  with  the  improved  imaging  system,  are  superior  to  those 
obtained  with  current  commercial  systems.  We  have  also  shown  that  the  quality 
approaches  that  of  the  screen-film  images.  We  anticipate  that  image  acquisition  will  be 
completed  in  January  of  1995  and  the  ROC  study  will  begin  in  February. 
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Table  #1  The  line  palrs/mm  value  for  each  resolution 
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SPECIFICATIONS  FOR  THE  CIRS  BREAST  PHANTOM 
(Computerized  Imaging  Reference  Systems,  Inc.) 


XI 


Map  of 

Embedded  Details  in  Phantoms. 


•  Stepwedge  .  Masses 

5  mm  glandular  equivalent/5  mm  fat  6  simulated  tumor  masses  ranging  from 

equivalent  6  mm  diameter  to  1  mm  diameter 

•  Microcalcifications  (CaCO  )  •  Fibrils 

1 2  groupings  with  largest  at  .35  mm  3  fibre  spiculations 

diameter  and  smallest  at  .12  mm 

diameter.  This  range  of  sizes  has  been 

shown  in  practice  to  be  sensitive  to 

system  calibration  and  performance. 
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APPENDIX  IV 

CAPTIONS  FOR  FILMS 


Film  1  Storage  phosphor  image  of  a  resolution  test  pattern  (Nuclear  Associates,  Model 
07-553)  acquired  with  a  high  resolution  storage  phosphor  screen  and  scanned 
with  the  modified  storage  phosphor  image  reader  with  a  reduced  beam  spot  size 
of  40pm  and  a  pixel  size  of  43pm.  Notice  that  the  10  lps/mm  bars  are  clearly 
rendered.  The  horizontal  resolution  is  slightly  better  than  the  vertical  one.  The 
image  shows  much  better  spatial  resolution  than  those  obtained  with 
commercial  storage  phosphor  imaging  systems. 

Film  2  Storage  phosphor  image  of  a  resolution  test  pattern  (Nuclear  Associates,  Model 
07-553)  acquired  with  a  14"xl7"  standard  resolution  storage  phosphor  screen 
and  scanned  with  a  commercial  storage  phosphor  image  reader  (KESPR, 
Eastman  Kodak  Company).  The  pixel  size  is  173pm.  Notice  that  bar  pattern  is 
resolved  at  up  to  2.4  lps/mm.  Although  there  is  ample  contrast,  serious  aliasing 
effects  are  present  at  frequencies  above  2.4  lps/mm. 

Film  3  Storage  phosphor  image  of  a  resolution  test  pattern  (Nuclear  Associates,  Model 
07-553)  acquired  with  a  8"xl0"  high  resolution  storage  phosphor  screen  (Fuji 
HRIII-n)  and  scanned  with  a  commercial  storage  phosphor  image  reader 
(KESPR,  Eastman  Kodak  Company).  The  pixel  size  is  100pm.  Notice  that  bar 
pattern  is  resolved  at  up  to  5  lps/mm.  Although  there  is  some  contrast,  serious 
aliasing  effects  are  present  at  frequencies  above  5  lps/mm. 

Film  4  Storage  phosphor  image  of  a  CIRS  breast  phantom  acquired  with  a  high 

resolution  storage  phosphor  screen  (Fuji  HRIIIn)  and  scanned  with  the  improved 
storage  phosphor  image  reader.  The  x-ray  techniques  used  are  identical  to  those 
used  in  obtaining  the  screen-film  image  in  Film  5.  Notice  that  most 
calcifications  (with  a  diameter  of  8  mils  or  greater)  and  masses  (with  a  diameter 
of  2  mil  (50  pm)  or  greater)  are  well  resolved. 

Film  5  Screen-film  image  of  the  specially  designed  wire  phantom  acquired  with  grids  at 
25  kVp  and  208  mAs.  The  screen-film  image  shows  the  calcifications,  masses 
and  fabrils  with  a  sharper  appearance  which  is  partially  due  to  its  smaller  (by  a 
factor  of  2)  format.  However,  it  does  not  resolve  more  than  what  can  be 
resolved  in  the  storage  phosphor  image.  This  shows  that  at  typical  clinical 
exposure  level  (as  determined  by  the  automatic  exposure  control),  the  improved 
storage  phosphor  imaging  system  approaches  the  conventional  screen-film 
combinations  in  quality  as  far  as  detection  of  microcalcifications  and  masses  is 
concerned. 
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Film  6  Storage  phosphor  image  of  the  specially  designed  wire  phantom  acquired  with  a 
high  resolution  storage  phosphor  screen  (Fuji  HRIIIn)  and  scanned  with  the 
improved  storage  phosphor  image  reader.  The  x-ray  techniques  used  are 
identical  to  those  used  in  obtaining  the  screen-film  image  in  Film  7.  Notice  that 
the  5  mil  (125pm)  wire  segments  are  well  resolved  while  the  4  mil  (100pm) 
wire  segments  are  marginally  resolved.  All  but  the  2  mil  (50  pm)  long  wires 
(oriented  in  radial  direction)  are  resolved. 

Film  7  Screen-film  image  of  the  specially  designed  wire  phantom  acquired  with  grids  at 
25  kVp  and  26  mAs.  The  screen-fdm  image  shows  the  wire  segments  with  a 
sharper  appearance  which  is  partially  due  to  its  smaller  (by  a  factor  of  2)  format. 
This  helps  resolve  the  4  mil  (100pm)  wire  segments.  However,  it  does  not 
resolve  finer  segments  (2,  3  mil)  or  wires  (2  mil).  This  shows  that  at  typical 
clinical  exposure  level  (as  determined  by  the  automatic  exposure  control),  the 
improved  storage  phosphor  imaging  system  approaches  the  conventional  screen- 
film  combinations  in  quality  as  far  as  detection  of  microcalcifications  is 
concerned. 

Film  8  Storage  phosphor  image  of  the  multiple  window  meal  phantom  designed  for 

image  acquisition  in  this  project.  Groups  of  five  calcifications  (simulated  by  4, 

5  or  6  mil  aluminum  wire  segments)  are  present  in  selected  window  #2,  3,  5,  7, 
8,  9,  10,  11  and  12.  5  and  6  mil  calcifications  are  largely  visible  with  different 
degrees  of  difficulty,  depending  on  the  background  structures.  4  mil 
calcification  are  generally  difficult  or  impossible  to  visualize. 

Film  9  Min-R  screen-film  image  of  the  multiple  window  meat  phantom  designed  for 
image  acquisition  in  this  project.  Notice  that  the  aluminum  wire  segments  (to 
simulate  calcifications)  have  a  sharper  appearance  in  the  screen-film  image. 

This  increases  the  ease  for  the  5  and  6  mil  calcifications  to  be  seen.  However,  4 
mil  calcifications  (in  window  #3  and  5)  are  still  difficult  to  detect  though 
perhaps  not  impossible. 
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Joint  Photographic  Experts  Group  (JPEG)  Compatible  Data 
Compression  of  Mammograms 

Walter  F.  Good,  Glenn  S.  Maitz,  and  David  Gur 


We  have  developed  a  Joint  Photographic  Experts 
Group  (JPEG)  compatible  image  compression  scheme 
tailored  to  the  compression  of  digitized  mammo- 
graphic  images.  This  includes  a  preprocessing  step 
that  segments  the  tissue  area  from  the  background, 
replaces  the  background  pixels  with  a  constant  value, 
and  applies  a  noise-removal  filter  to  the  tissue  area. 
The  process  was  tested  by  performing  a  just-notice- 
able  difference  (JND)  study  to  determine  the  relation¬ 
ship  between  compression  ratio  and  a  reader's  ability 
to  discriminate  between  compressed  and  noncom- 
pressed  versions  of  digitized  mammograms.  We  found 
that  at  compression  ratios  of  15:1  and  below,  image- 
processing  experts  are  unable  to  detect  a  difference, 
whereas  at  ratios  of  60:1  and  above  they  can  identify 
the  compressed  image  nearly  100%  of  the  time.  The 
performance  of  less  specialized  viewers  was  signifi¬ 
cantly  lower  because  these  viewers  seemed  to  have 
difficulty  in  differentiating  between  artifact  and  real 
information  at  the  lower  and  middle  compression 
ratios.  This  preliminary  study  suggests  that  digitized 
mammograms  are  very  amenable  to  compression  by 
techniques  compatible  with  the  JPEG  standard.  How¬ 
ever,  this  study  was  not  designed  to  address  the 
efficacy  of  image  compression  process  for  mammogra¬ 
phy,  but  is  a  necessary  first  step  in  optimizing  the 
compression  in  anticipation  of  more  elaborate  reader 
performance  (ROC)  studies. 

Copyright  ©  1994  by  W.B.  Saunders  Company 

KEY  WORDS:  Data  compression,  mammography.  Joint 
Photographic  Experts  Group  (JPEG),  image  process¬ 
ing,  picture  archiving  and  communication  system. 

IN  RECENT  YEARS,  there  has  been  a  rapid 
increase  in  the  use  of  mammography  and  a 
corresponding  increase  in  the  complexity  of 
managing  mammographic  studies.  Many  believe 
that  to  accommodate  the  large  number  of  mam¬ 
mographic  studies  now  being  performed,  these 
images  must  be  managed  in  a  digital  format. 
The  main  difficulty  with  this  relates  to  the  large 
volume  of  digital  data  needed  to  represent  each 
image.  High-resolution  digital  mammograms  are 
likely  to  contain  10  to  40  Mbyte  each.  Digital¬ 
archiving  and  data-transmission  systems  quickly 
become  strained  when  called  upon  to  deal  with 
a  large  number  of  these  images.  The  application 
of  image-compression  techniques  has  the  poten¬ 
tial  to  greatly  reduce  this  problem. 

Because  the  problems  of  dealing  with  digital 
image  data  are  universal,  the  Joint  Photo¬ 
graphic  Experts  Group  (JPEG)  has  recently 


defined  a  general-purpose  image-compression 
standard.13  Because  the  JPEG  standard  does 
not  specifically  address  mammographic  images, 
our  aim  in  this  study  was  to  develop  a  compres¬ 
sion  technique  for  digitized  mammograms  which 
is  compatible  with  the  standard,  but  which  is 
optimized  for  mammographic  images,  and  then 
to  determine  the  level  of  compression  that  can 
be  achieved  before  the  effects  of  compression 
become  detectable  by  observers. 

The  JPEG  algorithm  is  a  block  quantization 
version  of  the  cosine  transform  in  which  images 
are  divided  into  8-  x  8-pixel  blocks  and  the 
cosine  transforms  applied  to  each  block  individu¬ 
ally.  The  frequency  coefficients  calculated  from 
the  cosine  transform  are  quantized  by  dividing 
by  values  from  an  array  of  quantization  factors 
and  rounding  the  quotients  to  integral  values. 
The  resulting  array  of  quotients  is  aligned  in  a 
zigzag  order  and  then  encoded  by  either  Huff¬ 
man  coding  or  by  adaptive  arithmetic  coding.  In 
this  method,  the  quantization  factors  that  are 
applied  to  the  frequency  coefficients  determine 
the  compression  ratio  as  well  as  the  kind  of 
information  lost  or  artifact  introduced. 

The  main  challenge  in  applying  JPEG  stan¬ 
dard  compression  relates  to  the  need  to  tailor 
the  technique  to  the  characteristics  of  the  par¬ 
ticular  kind  of  images  being  compressed.  There 
are  two  main  areas  where  there  is  flexibility  in 
applying  the  standard — in  the  quantization  of 
the  frequency  coefficients  and  in  preprocessing 
the  image  before  the  application  of  the  JPEG 
algorithm. 

In  the  past,  most  attention  has  focused  on  the 
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design  of  the  quantization  table  required  in  the 
quantization  stage  of  the  compression  process. 
The  default  quantization  factors,  which  are  pro¬ 
vided  in  the  standard,  are  not  adequate  for  most 
purposes- — hence,  the  standard  encourages  user- 
specified  quantization  tables.  But  the  resulting 
image  quality  is  determined  by  the  quantization 
table  used,  so  the  design  of  these  tables  must  be 
optimized  for  the  particular  compression  task. 

The  current  trend  is  to  base  quantization 
tables  on  psychophysical  consideration.4  5  This 
approach  is  somewhat  controversial  when  the 
compression  is  to  be  used  for  radiographic 
images,  because  psychophysical  compression 
techniques  assume  prior  knowledge  of  display 
characteristics  and  viewing  conditions.  The  most 
common  alternative  is  to  use  uniform  quantiza¬ 
tion.  The  justification  for  this  rests  on  the 
assumption  that  there  is  no  a  priori  reason  to 
believe  that  certain  spatial  frequencies  are  more 
important  than  others. 

In  our  interpretation,  the  JPEG  standard 
implies  that  any  compression  process  is  compat¬ 
ible  with  the  standard  if  the  encoded  images  it 
produces  can  be  decoded  with  a  standard  JPEG 
decoder.  Specifically,  this  permits  various  kinds 
of  preprocessing  to  be  applied  to  the  input 
image  data  before  the  actual  compression.  Such 
preprocessing  can  improve  the  overall  results  of 
the  compression/decompression  cycle.  For  ex¬ 
ample,  we  have  shown7  that  when  root  mean 
square  error  (RMSE)  is  used  as  the  measure  of 
image  fidelity,  at  sufficiently  large  compression 
ratios,  there  are  significant  advantages  to  apply¬ 
ing  a  noise-removal  filter  to  the  image  data 
before  compression. 

Another  aspect  of  this  preprocessing,  which  is 
somewhat  unique  to  mammography,  involves 
the  segmentation  of  the  image.  Much  of  the  area 
in  a  typical  digitized  mammogram  corresponds 
to  background  pixels  rather  than  to  tissue  pixels. 
These  background  pixels  contain  noise  that  is 
typically  difficult  to  compress.  To  optimize  com¬ 
pression  in  this  case,  it  is  important  to  minimize 
the  amount  of  data  required  to  represent  these 
background  pixels  in  the  encoded  image.  This 
can  be  done  in  a  JPEG-compatible  manner  by 
first  cropping  the  image  to  the  smallest  rectangu¬ 
lar  area  that  contains  all  of  the  tissue,  and  then 
identifying  the  remaining  background  pixels 
and  replacing  them  with  a  constant  value. 


With  these  considerations  in  mind,  in  our 
laboratory  we  have  developed  and  begun  testing 
software  for  JPEG-compatible  compression  of 
digitized  mammograms. 

MATERIALS  AND  METHODS 

As  described  in  more  detail  below,  our  compression 
software  is  consistent  with  the  extended  12-bit  version  of  the 
JPEG  standard  and  uses  a  set  of  preprocessing  routines 
designed  specifically  for  mammograms.  This  software  was 
written  in  C  to  be  transportable,  but  all  of  the  processing 
reported  in  this  paper  was  performed  on  a  standard  33-MHz 
486  personal  computer.  We  have  tested  this  software  by 
using  it  to  compress  a  set  of  digitized  mammograms  at 
various  levels  of  quantization,  and  then  evaluating  the 
quality  of  these  compressed  images  by  performing  a  just- 
noticeable  difference  (JND)  study. 

Preprocessing.  Before  applying  the  JPEG  compression 
algorithm,  the  image  data  is  preprocessed  to  improve  its 
compressibility.  This  preprocessing  operation  is  divided  into 
two  phases;  first  the  image  is  segmented  and  cropped  and 
the  background  pixel  values  are  modified  to  minimize  the 
storage  requirements  for  the  background,  and  then  a 
noise-removal  filter  is  applied  to  the  tissue  pixels. 

We  begin  the  segmentation  process  by  automatically 
determining  a  threshold  value  from  the  histogram  of  the 
digitized  source  image  that  indicates  the  transition  between 
background  values  and  tissue  values.  This  threshold  value  is 
used  to  classify  pixels  in  the  image  as  either  tissue  or 
background  so  as  to  produce  a  two-valued  mask  correspond¬ 
ing  to  the  tissue-background  separation.  The  thresholded 
image  normally  consists  of  a  large  contiguous  region  corre¬ 
sponding  to  tissue,  possibly  containing  small  holes,  as  well  as 
smaller  isolated  clusters  of  points  from  the  noisier  back¬ 
ground  regions. 

An  initial  “seed”  point  in  the  tissue  area  is  automatically 
selected  and  grown  to  encompass  the  largest  possible 
connected  region.  This  produces  a  binary  mask  matching 
the  tissue  pixels  in  the  source  image.  This  mask  is  expanded 
by  15  pixels— ie,  any  pixel  that  is  within  15  pixels  of  a  tissue 
pixel  is  appended  to  the  mask.  This  expansion  is  effected  by 
convolving  the  binary  mask  with  a  31-  x  31-pixel  uniformly 
weighted  kernel  and  then  assigning  all  nonzero  pixels  the 
value  1. 

We  then  convolve  this  binary  mask  with  a  17-  x  17-pixel 
uniformly  weighted  kernel  to  smooth  the  edge  between  the 
mask  (pixels  =  1)  and  background  (pixels  =  0).  This  ex¬ 
panded  and  smoothed  mask,  whose  pixel  values  represent 
an  interpolation  factor  has  a  value  of  1  for  every 
soft-tissue  pixel  and  every  background  pixel  within  7  pixels 
of  a  tissue  pixel,  and  goes  smoothly  to  zero  over  a  distance  of 
16  pixels  as  the  distance  from  tissue  pixels  increases.  This 
mask  and  the  source  image  are  both  cropped  to  a  rectangu¬ 
lar  area  that  excludes  as  many  zero  pixels  as  possible  while 
retaining  all  nonzero  pixels. 

A  constant  value,  which  is  to  be  used  in  modifying 
background  pixels,  is  determined  by  averaging  the  back¬ 
ground  pixels  that  are  near  the  skin  boundary.  Finally,  the 
output  image  is  created  pixel  by  pixel  by  interpolating 
between  the  source  image  and  the  constant  value  using  the 


DATA  COMPRESSION  OF  MAMMOGRAMS 


125 


formula 

output  =  X  ■  source  +  (1  -  X)  ■  constant  (I) 

The  resulting  image  has  the  following  characteristics:  (1) 
any  8-  x  8-pixel  block  that  contains  a  tissue  pixel  will  remain 
unchanged  from  the  source  image,  eliminating  the  possibil¬ 
ity  that  the  segmentation  process  will  alter  the  effect  of 
compression  on  any  tissue  pixel;  and  (2)  background  pixels 
will  have  been  modified  to  change  in  a  smooth  and  continu¬ 
ous  manner  from  their  original  values  to  the  prescribed 
constant  value  as  a  function  of  distance  from  the  nearest 
tissue  pixel,  thus  reducing  the  possibility  that  the  segmenta¬ 
tion  process  will  introduce  an  edge  artifact  at  the  tissue/ 
background  boundary. 

The  second  preprocessing  phase  involves  the  application 
of  a  filter  to  the  tissue  area  of  the  image.  The  particular 
filter  tested  here  is  a  nonlinear  order  filter7-8  that  was  de¬ 
signed  to  modify  mainly  pixels  with  large  noise  components 
while  leaving  the  majority  of  pixels  unchanged.  This  filter 
operates  as  follows.  The  source  image  is  first  decomposed 
into  a  high-frequency  component  and  a  low-frequency 
component  by  unsharp  masking — a  process  whereby  a  low- 
frequency  version  of  the  source  image,  obtained  by  smooth¬ 
ing  the  source,  is  subtracted  from  the  source  image  to 
produce  an  edge-enhanced  (high-frequency)  version  of  the 
source.  Pixel  values  in  each  3-  x  3-pixel  block  of  the 
high-frequency  component  are  sorted.  If  the  center  pixel  of 
the  block  is  the  highest  value  in  the  block,  it  is  assigned  the 
value  of  the  next  lower  pixel.  Similarly,  if  it  is  the  lowest 
value,  it  is  assigned  the  next  higher  value.  Otherwise,  the 
pixel  is  left  unchanged.  This  results  in  a  rather  innocuous 
process  of  eliminating  one  pixel  maxima  or  minima  from  the 
high-frequency  component,  while  leaving  most  pixels  un¬ 
changed.  The  small  number  of  pixels  changed  are  likely  to 
represent  noise,  as  their  values  are  either  greater  or  less 
than  the  values  of  all  their  immediate  neighbors.  Microcalci¬ 
fications  in  the  image  are  generally  preseved  because  they 
rarely  consist  of  a  single  pixel.  Finally,  the  filtered  high- 
frequency  component  is  added  back  to  the  low-frequency 
component  to  produce  the  filtered  source  image. 

These  preprocessing  operations  require  two  convolutions 
with  uniform  kernels,  two  applications  of  a  threshold,  a 
region  growing  step,  an  interpolation  step,  and  application 
of  the  nonlinear  filter.  In  our  implementation,  which  does 
not  require  manual  intervention,  all  of  these  steps  are 
performed  in  approximately  one  third  of  the  time  of  the 
compression  itself. 

Image  compression.  The  JPEG  compression  algorithm 
used  in  this  study  is  a  software  package  written  and  tested  in 
our  laboratory  that  implements  the  12-bit  version  of  the 
extended  JPEG  standard.  For  this  study,  Huffman  encoding 
was  used  to  encode  the  quantized  coefficients  and  all 
compression  ratios  were  based  on  this.  The  Huffman  tables 
were  derived  from  the  statistics  of  each  individual  quantized 
image  rather  than  on  the  ensemble  statistics.  Signaling 
information  was  not  included  in  the  code  size  when  calculat¬ 
ing  compression  ratios. 

Results  reported  in  this  paper  are  based  on  uniform 
quantization  of  the  frequency  coefficients.  As  discussed 
below,  any  quantization  scheme  that  preferentially  de¬ 
grades  the  high-frequency  components  (ie,  most  of  the 


psychophysical  quantization  schemes)  may  be  contraindi¬ 
cated  in  the  compression  of  mammographic  images. 

Evaluation  of  image-compression  technique.  In  an  effort 
to  understand  the  viability  of  the  methods  described  above, 
we  carefully  selected  eight  high-quality  cases  that  spanned 
the  range  of  image  characteristics  customarily  encountered 
in  mammography,  with  some  preference  being  given  to 
images  containing  subtle  pathology.  These  cases  were  digi¬ 
tized  and  the  digital  data  was  compressed  over  a  wide  range 
of  compression  ratios  by  applying  the  above  techniques. 

The  digitization  process  used  a  high-resolution,  high- 
contrast  sensitivity  laser  film  digitizer  (Lumisys,  Sunyvale, 
CA)  that  produces  a  scan  matrix  of  4,000  x  5,000  pixels  for 
an  8-  x  10-in  film  by  digitizing  at  a  50-p.m  sampling  interval. 
This  pixel  resolution  results  in  a  Nyquist  spatial  frequency 
of  10  cycles/mm,  which  preserves  the  signal  resolution  of 
the  analog  film.  The  modulation  transfer  function  (MTF)  at 
the  Nyquist  frequency  is  30%  in  the  fast-scan  direction  and 
38%  in  the  slow-scan  direction.  The  16-bit  A7D  converter  of 
the  digitizer  permits  a  density  measurement  with  an  RMSE 
of  less  than  0.01.  This  represented  less  than  one  third  of  the 
noise  present  in  the  conventional  film  at  densities  exceeding 
0.4. 

Each  digitized  image  was  compressed  at  five  different 
compression  ratios.  Because  it  is  the  degree  of  quantization 
rather  than  the  compression  ratio  itself  that  determines  the 
degradation  of  an  image  by  compression,  the  quantization 
factor  was  used  as  an  independent  variable  and  not  the 
actual  compression  ratio.  The  five  quantization  factors  we 
applied  were  40,  60,  80,  100,  and  120,  which  produced 
average  compression  ratios  between  15:1  and  60:1.  The 
lowest  quantization  factor  is  at  a  level  below  which  it  is  not 
expected  that  one  could  detect  any  effect.  The  highest 
quantization  factor  produced  a  mean  compression  ratio  of 
60:1,  which  is  higher  than  what  is  required  to  permit  the 
efficient  handling  of  mammographic  images  with  current 
digital  technology,  and  it  is  in  the  range  where  studies  of 
other  algorithms  applied  to  other  image  types  have  shown 
significant  deterioration  in  image  quality.910  Note  that  these 
particular  quantization  factors  are  only  appropriate  for 
images  digitized  at  the  resolution  and  with  the  noise 
characteristics  under  consideration  in  this  study. 

All  images  used  in  this  study,  both  the  compressed 
versions  as  well  as  the  original  digitized  data,  were  printed 
on  film  with  a  laser  film  printer  (Eastman-Kodak,  Roches¬ 
ter,  NY).  These  laser-printed  films  were  used  for  all  image 
comparisons.  It  was  not  possible  for  us  to  perform  a 
meaningful  comparison  of  the  laser-printed  images  to  the 
original  (nondigitized)  images  because  our  laser  printer  was 
only  capable  of  printing  at  a  pitch  of  ~  80  pm/pixel,  and  this 
resulted  in  substantial  (1.6  x)  magnification  of  the  laser- 
printed  images.  However,  comparing  two  images  that  have 
both  been  enlarged  to  the  same  degree  is  possible,  and  if  the 
enlargement  had  any  effect,  we  believe  it  would  have  been 
to  make  it  easier  for  readers  to  discriminate  between 
noncompressed  images  and  compressed  images.  The  overall 
digitizer-processor  response  was  calibrated  to  accurately 
duplicate  the  densities  and  contrast  in  the  original  film. 
Noise  contributed  by  the  laser  printer  is  substantially  less 
than  image  noise  caused  by  quantum  mottle  and  film 
granularity  in  the  original  film. 


126 


GOOD,  MAITZ,  AND  GUR 


Table  1:  The  Effect  of  Image  Segmentation  on  Tissue  Block 
Representation  (bits  per  block) 


Image 

%  of  Area 
Representing 
Tissue 

Bits  per 
Tissue  Block 
(no 

preprocessing) 

Bits  per 
Tissue  Block 
(after 

segmentation) 

Improvement 

Factor 

i 

51.1 

8.6 

30.1 

3.5 

2 

52.9 

8.5 

27.0 

3.2 

3 

70.5 

9.1 

32.3 

3.5 

4 

71.7 

10.0 

28.0 

2.8 

5 

72.4 

9.4 

23.6 

2.5 

6 

56.8 

7.6 

26.5 

3.5 

7 

70.7 

24.2 

27.4 

1.1 

8 

67.0 

18.0 

22.2 

1.2 

From  the  eight  originals,  which  had  each  been  com¬ 
pressed  at  five  compression  ratios,  a  set  of  40  randomized 
pairs  of  images,  each  consisting  of  one  laser-printed  noncom- 
pressed  image  and  one  compressed  version  of  the  original, 
were  assembled.  Readers  were  asked  to  view  the  pairs  side 
by  side  and  were  forced  to  choose  the  image  they  believed  to 
have  been  compressed.  Readers  were  allowed  to  spend  as 
much  time  as  desired  on  each  pair.  The  reading  environ¬ 
ment  was  equivalent  to  that  normally  used  for  reading 
mammograms  and  readers  were  permitted  to  use  magnify¬ 
ing  lenses. 

Altogether,  20  readers  participated  in  these  readings. 
They  included  radiologists  experienced  in  mammography  as 
well  as  physicists  and  engineers  specializing  in  medical 
imaging. 

RESULTS 

The  performance  of  our  compression  system 
is  somewhat  image  dependent,  but  for  the 
ensemble  of  images  reported  on  here,  we  aver¬ 
aged  about  20  s/Mbyte  to  preprocess  and  com¬ 
press  source  data.  Because  decompression  does 
not  involve  any  steps  similar  to  preprocessing, 
we  were  able  to  perform  it  at  a  rate  of  15 
s/Mbyte  of  output  data.  Although  these  speeds 
are  not  acceptable  for  a  clinical  picture  archiv¬ 
ing  and  communication  system  (PACS)  environ¬ 
ment,  we  believe  they  can  be  easily  improved  by 
adopting  hardware  optimized  for  the  task. 

A  summary  of  the  impact  of  image  segmenta¬ 
tion  on  the  number  of  bits  needed  to  represent 
tissue  blocks  is  presented  in  Table  1.  All  images 
included  in  this  table  had  previously  been 
cropped  to  a  rectangular  area  whose  border  was 
within  1  cm  of  tissue  pixels  in  each  dimension. 
Column  2  gives  the  fraction  of  the  cropped 
image  occupied  by  tissue  pixels.  Columns  3  and 
4  give  the  average  number  of  bits  used  to 
represent  each  8-  x  8-pixel  block  of  tissue  pixels 
before  and  after  application  of  the  segmenta¬ 


tion  process,  for  an  overall  compression  ratio  in 
both  cases  of  40:1.  It  can  be  seen  from  this  table 
that  our  segmentation  process  increased  the 
number  of  bits  used  to  represent  each  tissue 
pixel  by  a  factor  of  2.7  on  the  average  despite 
the  fact  that  the  background  represented  only 
35.9%  of  the  image  area. 

From  approximately  800  observations  in  the 
JND  experiment,  average  observers  correctly 
detected  42%,  55%,  47%,  61%,  and  75%  of  the 
images  for  the  15:1,  23:1,  34:1,  45:1,  and  60:1 
compression  ratios,  respectively.  However,  when 
we  looked  at  the  performance  of  individual 
readers,  we  found  that  there  were  two  readers 
who  were  particularly  good  at  making  the  dis¬ 
criminations  required  in  this  study.  These  read¬ 
ers  were  a  physicist  and  an  electrical  engineer 
who  were  both  specialists  in  image  processing 
and  were  familiar  with  the  particular  artifacts 
caused  by  this  type  of  image  compression.  These 
image-processing  specialists  averaged  discrimi¬ 
nation  rates  of  50%,  67%,  75%,  87%,  and  100% 
for  the  above  compression  ratios.  Their  surpris¬ 
ing  consistency  with  each  other  and  the  mono¬ 
tonic  trend  of  their  performance  led  us  to 
believe  that  we  were  actually  measuring  a  mean¬ 
ingful  threshold.  A  plot  of  the  results  of  this 
JND  study  is  shown  in  Fig  1. 

DISCUSSION 

Performance  of  compression  process.  The  re¬ 
lationship  between  the  compressibility  of  tissue, 
the  compressibility  of  the  background,  and  the 
relative  areas  between  tissue  and  background 


Fig  1.  Performance  of  image-processing  specialists  and 
nonspecialists  for  the  task  of  identifying  compressed  versus 
noncompressed  images. 


DATA  COMPRESSION  OF  MAMMOGRAMS 


127 


can  be  expressed  as 

C|  ~  A,  +  a[  V  <2) 

c  cb 

in  which  C,  and  Cb  are,  respectively,  the  compres¬ 
sion  ratios  forH,  andAb,  the  areas  correspond¬ 
ing  to  tissue  and  background,  and  C  is  the 
overall  compression  ratio  for  the  image.  The 
potential  improvement  factor  in  the  number  of 
bits  per  tissue  pixel  that  can  be  achieved  through 
segmentation,  assuming  the  background  is  com¬ 
pressed  to  a  negligible  size  after  segmentation, 
is 


A,  +  Ab  Cb 

where  Cb  is  the  compression  ratio  of  the  back¬ 
ground  without  segmentation.  In  all  cases,  after 
segmentation  we  were  able  to  compress  the 
background  at  a  ratio  of  256:1,  which  corre¬ 
sponds  to  using  3  bits  to  represent  each  8-  x 
8-pixel  block.  Theoretically  in  some  unusual 
cases,  it  would  be  possible  to  use  only  2  bits  per 
block,  but  this  is  an  absolute  limit  for  the  JPEG 
algorithm  (sequential  discrete  cosine  transform 
[DCT]  and  Huffman  encoding).  The  actual 
improvements  in  bits  per  tissue  pixel  are  indi¬ 
cated  in  Table  1,  column  5. 

Note  that  images  7  and  8  behaved  somewhat 
differently  than  the  other  six  images.  The  back¬ 
grounds  of  these  images  were  so  dense  that  they 
were  beyond  the  range  of  our  digitizer.  Conse¬ 
quently,  many  background  pixels  values  were 
saturated  at  the  maximum  output  of  our  digi¬ 
tizer,  and  hence,  the  apparent  noise  in  the 
background  was  greatly  reduced.  Saturation  of 
the  background  had  much  the  same  effect  as  our 
segmentation  process,  but  has  the  potential  to 
also  alter  tissue  pixels  in  an  undesirable  way. 
We  have  included  these  particular  images  after 
verifying  that  the  tissue  pixels  themselves  were 
not  saturated. 

For  images  1  through  6,  our  segmentation 
process  increased  the  number  of  bits  used  to 
represent  each  tissue  pixel  by  a  factor  of  3.2  on 
the  average.  If  images  7  and  8  are  also  included, 
this  effect  is  reduced  to  2.7.  The  average  area 
occupied  by  tissue  pixels  in  cropped  images  1 


through  6  was  62.6%.  It  can  be  seen  that  for 
images  in  which  the  background  is  not  satu¬ 
rated,  the  average  improvement  in  bits  per 
tissue  pixel  is  much  larger  than  what  would  be 
suggested  simply  by  the  relative  areas  occupied 
by  the  tissue  pixels  and  background  pixels. 

We  illustrate  what  happens  during  the  seg¬ 
mentation  process  by  using  image  number  6 
from  our  study  as  a  specific  example.  This 
image,  which  is  partially  shown  in  Fig  2,  is 
typical  of  what  we  have  observed  in  general. 
Initially,  this  mammogram  was  digitized  to  a 
12.3-Mbyte  file.  Cropping  reduced  this  by  5.7% 
to  11.6  Mbyte,  and  of  this  file,  56.8%  of  the 
pixels  corresponded  to  tissue. 

A  plot  of  the  compressibility -of  various  re¬ 
gions  of  this  image,  which  are  identified  in  Fig  2, 
is  presented  in  Fig  3.  The  independent  variable 
on  this  graph,  the  quantization  factor,  is  speci¬ 
fied  at  the  time  of  compression  and  directly 
controls  the  loss  of  information  during  the 
compression  process.  It  can  be  seen  that  differ¬ 
ent  regions  of  the  image  vary  widely  with  re¬ 
spect  to  their  compressibility  and  that  the  back¬ 
ground  is  significantly  less  compressible  than 
either  of  the  tissue  regions.  In  many  cases  such 
as  this  one,  because  the  background  is  very 
noisy,  it  is  the  least  compressible  part  of  the 
image.  When  the  nonsegmented  version  of  this 
image  was  compressed  at  a  ratio  of  25:1  (0.48 
bits  per  pixel),  the  compression  ratio  of  the 
background  was  only  14.9:1  compared  with 
51.8:1  (0.23  bits  per  pixel)  for  the  tissue  pixel. 
The  encoded  file  contained  0.464  Mbyte  of  data 
of  which  0.337  Mbyte  was  needed  to  describe 
the  background  and  the  remaining  0.127  Mbyte 
represented  tissue.  In  other  words,  despite  the 
fact  that  the  background  was  only  43%  of  the 
source  image,  more  than  72%  of  the  data  in  the 
encoded  file  was  used  to  describe  the  back¬ 
ground.  After  the  image  was  preprocessed  and 
compressed  at  the  same  25:1  compression  ratio, 
the  constant  background  in  the  segmented  im¬ 
age  was  now  compressed  at  a  ratio  of  256:1  (.05 
bits  per  pixel)  and  the  tissue  area  was  com¬ 
pressed  at  a  ratio  of  14.8:1  (0.81  bits  per  pixel). 
The  background  was  reduced  to  only  0.02  Mbyte 
or  about  4%  of  the  encoded  data.  The  net  effect 
was  that  segmentation  increased  the  amount  of 
data  used  to  represent  tissue  pixels  by  a  factor 
of  3.5.  Figure  4  compares  the  overall  compress- 
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Fig  2.  Image  showing  the 
three  typical  regions  of  interest 
in  different  areas  representing 
tissue  and  background. 


ibility  of  the  segmented  image  to  that  of  the 
nonsegmented  image.  The  compressibility  of  a 
chest  image  is  also  included  for  comparison. 

Figure  5  shows  in  more  detail  the  relative 
number  of  bits  per  tissue  pixel  between  the 


Fig  3.  Comparison  of  the  compressibility  of  the  three 
regions  marked  in  Fig  2,  with  the  compressibility  of  the  image 
as  a  whole. 


segmented  and  nonsegmented  images.  The  ac¬ 
tual  pattern  of  the  compressibility  of  various 
parts  of  the  image  is  also  of  interest  because  it 
provides  information  that  eventually  will  be 
needed  for  optimizing  the  relative  compression 


Fig  4.  Comparison  of  the  compressibility  of  mammograms 
with  and  without  preprocessing  to  the  compressibility  of 
digitized  chest  images. 
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Fig  5.  Comparison  of  the  mean  number  of  bits  used  to 
represent  a  tissue  pixel  for  segmented  versus  nonsegmented 
images. 


of  different  tissues.  Figure  6,  A  and  B  show  the 
pattern  of  bits  per  pixel  with  and  without 
preprocessing.  It  is  clear  from  a  comparison  of 
these  images  that  preprocessing  results  in  a 
dramatic  shift,  from  bits  being  used  to  represent 
background  to  being  used  to  represent  tissue. 
One  further  observation  from  Fig  6B  is  that  a 
large  fraction  of  the  bits  used  to  represent  tissue 
are  in  fact  used  to  represent  only  the  regions 
near  skin.  This  observation  is  consistent  through¬ 
out  the  ensemble  of  images  we  have  studied  to 
date.  We  believe  it  is  mostly  caused  by  the  low 
compressibility  of  the  high-frequency  structures 
in  the  near-skin  regions  and  to  a  lesser  extent 
caused  by  the  high  density  of  the  skin  areas  on 
the  original  films  that  produces  increased  digi¬ 
tizer  noise.  This  can  be  seen  in  Fig  7,  which 


Fig  6.  Images  of  bits  per  block 
before  segmentation  (A)  and  af¬ 
ter  segmentation  (B)  for  an  im¬ 
age  compressed  at  40 : 1 . 
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relates  the  number  of  bits  per  pixel  to  film 
density  for  a  fixed  quantization  level,  for  reason¬ 
ably  flat  areas  of  an  image.  The  increasing  trend 
as  a  function  of  optical  density  is  caused  by  the 
increasing  noise  of  the  digitizer  as  film  density 
increases.  Also  shown  in  Fig  7  are  measure¬ 
ments  of  the  compressibility  of  skin  areas  as  a 
function  of  density.  These  points  are  clearly 
above  the  line.  We  have  observed  that  this  low 
compressibility  of  the  skin  areas  also  holds  for 
storage  phosphor  (computed  radiography)  im¬ 
ages  despite  the  very  different  noise  characteris¬ 
tics  of  these  images. 

Future  updates  to  the  JPEG  standard  are 
likely  to  contain  provisions  for  adaptive  quanti¬ 
zation.  When  this  happens,  it  will  be  necessary 
to  address  the  issue  of  the  relative  number  of 
data  bits  that  should  be  allocated  to  represent 
each  particiular  type  of  tissue. 

Effect  of  filtration.  The  effects  of  applying 
the  nonlinear  filter  to  mammogram  number  6  is 
shown  in  Fig  8.  Although  the  filter  only  changed 
6.7%  of  the  pixel  values,  it  seemed  to  provide  a 
net  benefit  at  the  higher  compression  ratios.  Its 
behavior  was  comparable  with  what  was  achieved 
when  a  similar  filter  was  applied  to  the  compres¬ 
sion  of  chest  images.7  Figure  8  shows  the  RMSE 
for  the  compression  of  the  filtered  source  along 
with  the  RMSE  for  the  compression  of  the 
nonfiltered  source  as  they  depend  on  the  com¬ 
pression  ratio.  It  can  be  seen  that  at  low 
compression  ratios,  the  RMSE  of  the  filtered 
image  is  higher  because  of  changes  in  the  image 
caused  by  the  filter,  but  at  higher  compression 


Fig  7.  Plot  of  compressibility  of  different  "nonskin"  tissue 
regions  as  a  function  of  film  density  (line).  Also  included  for 
comparison  are  data  from  different  points  in  the  skin  region. 


Fig  8.  RMSE  resulting  from  nonlinear  filtration  as  a  function 
of  compression  ratio.  Results  for  both  the  filtered  and  nonfil- 
tered  image  are  shown. 

ratios,  the  errors  caused  by  compression  over¬ 
whelm  the  error  caused  by  filtration  and  be¬ 
come  predominant.  The  compression-induced 
errors  are  consistently  smaller  for  the  filtered 
image  than  for  the  nonfiltered  image,  and  at 
higher  compression  ratios,  the  filtered  image 
actually  deviates  less  from  the  source  image. 
This  behavior  persisted  across  our  image  set, 
but  was  found  to  be  image  dependent.  It  is  our 
view  at  this  time  that  filtration  of  this  sort  is 
advantageous  at  compression  ratios  above  20:1, 
but  it  remains  to  be  tested  in  diagnostic  perfor¬ 
mance  studies. 

Effect  of  segmentation  and  cropping.  In  gen¬ 
eral,  for  all  of  our  images,  preprocessing  the 
digitized  data  made  a  dramatic  improvement  in 
compressibility.  This  improvement  can  be  attrib¬ 
uted,  for  the  most  part,  to  the  benefit  of  segmen¬ 
tation.  It  is  actually  the  degree  of  compression 
of  the  image  area  corresponding  to  tissue  pixels, 
or  the  number  of  bits  per  tissue  pixel,  rather 
than  the  compression  ratio  for  the  entire  image 
array,  that  directly  determines  the  impact  of 
compression  on  the  diagnostic  quality  of  mam- 
mographic  images.  In  mammography,  there  is 
wide  variation  from  case  to  case  in  the  propor¬ 
tion  of  the  film  occupied  by  tissue  pixels.  In  all 
cases,  after  segmentation  and  cropping,  the 
background  will  be  compressed  at  a  very  high 
compression  ratio,  whereas  the  tissue  pixels  are 
compressed  at  a  more  modest  ratio. 

If  images  are  not  segmented,  then  cropping  is 
particularly  beneficial.  However,  if  the  images 
are  segmented  by  the  above  procedure,  the 
benefit  of  cropping  is  largely  neutralized  by  the 
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efficiency  of  the  segmentation  process  in  increas¬ 
ing  the  compressibility  of  the  background.  Nev¬ 
ertheless,  cropping  can  be  useful,  because  when 
the  image  is  reconstructed,  it  will  retain  its  size 
before  encoding.  Maintaining  excessive  back¬ 
ground  pixels  could  have  adverse  effects  on  the 
time  required  to  encode  and  decode  the  image 
as  well  as  on  other  system  components  such  as 
image  displays. 

Preservation  of  image  quality.  Much  of  a 
mammographic  image  does  not  contain  appre¬ 
ciable  high-frequency  information,  but  the  need 
to  preserve  microcalcifications  with  dimensions 
of  less  than  100  pun,  which  sometimes  appear  in 
mammographic  films,  may  require  that  we  digi¬ 
tize  mammograms  at  50  pm.  This  has  the  effect 
of  oversampling  most  of  the  image — ie,  the 
actual  information  content  of  the  digitized  mam¬ 
mogram  is  much  less  than  the  size  of  the  image 
matrix  would  suggest. 

It  does  not  require  severe  quantization  to 
achieve  a  high  compression  ratio  on  areas  that 
do  not  contain  high-frequency  information.  The 
remainder  of  the  image,  such  as  areas  contain¬ 
ing  microcalcifications  or  the  skin,  is  not  as 
compressible,  but  these  areas  generally  com¬ 
prise  such  a  small  fraction  of  the  total  area  that 
they  have  minimal  impact  on  the  compression 
ratio.  Together,  these  factors  imply  that  it  may 
not  be  necessary  or  desirable  to  preferentially 
degrade  the  higher-frequency  components  in 
the  quantization  process  to  achieve  high  com¬ 
pression  ratios,  and  consequently,  it  is  possible 
to  preserve  features  such  as  microcalcifications 
during  the  compression  process. 

In  comparing  the  compressibility  of  mammo¬ 
grams  to  that  of  chest  images,  we  found  that 
preprocessed  mammograms  are  very  similar  to 
chest  images  in  compressibility;  however,  un¬ 
processed  mammograms  are  much  more  diffi¬ 
cult  to  compress  than  chest  images.  We  have 
included  a  plot  in  Fig  4  that  was  derived  from  a 

REF1 

1.  Joint  Photographic  Experts  Group  (JPEG):  JPEG 
Technical  Specification,  Revision  5  (Document  No.  JPEG- 
8-R5  or  JTC1/SC2/WG8  N933).  ISO  Central  Secretariat, 
1990 

2.  Hudson  GP,  Yasuda  H,  Sebestyen  I:  The  international 
standardization  of  a  still  picture  compression  technique. 
Proc  IEEE  Global  Telecommunications  Conference  & 
Exhibition,  vol  2,  November  28-December  1,  1988,  pp 
1016-1021 

3.  Wallace  G,  Vivian  R,  Poulsen  H:  Subjective  testing 


previous  study7  of  the  compression  ratio  of 
chest  images  versus  the  quantization  factor.  The 
chest  image  had  been  digitized  at  lOO-pm  reso¬ 
lution  with  12  bits  of  dynamic  range.  At  each 
level  of  quantization,  the  RMSE  for  the  chest 
image  was  similar  to  the  RMSE  for  mammo¬ 
grams  compressed  with  the  same  quantization 
factor. 

Observer  performance.  When  the  differences 
between  images  are  small,  it  is  possible  that 
readers  can  detect  the  difference,  but  still  not 
be  able  to  tell  which  image  is  compressed.  The 
key  to  differentiating  between  images  at  the 
lower  compression  ratios  seemed  to  be  the 
presence  or  absence  of  blocking  artifact.  For  the 
compression  techniques  we  applied  in  this  study, 
blocking  artifact  becomes  noticeable  before  any 
degradation  of  image  resolution  is  obvious. 
Blocking  artifact  could  be  detected  by  image- 
processing  specialists  and  nonspecialists  alike, 
but  the  nonspecialist  readers  were  more  likely 
to  misinterpret  it.  It  is  clear  from  discussions 
with  some  readers  upon  completion  of  their 
readings,  that  when  the  blocking  artifact  was 
barely  visible  they  sometimes  mistook  it  for 
actual  image  information  and  assumed  the  im¬ 
age  with  the  artifact  was  the  noncompressed 
image.  The  two  best  readers  seemed  to  be  able 
to  avoid  this  mistake. 

The  JND  study  presented  here  is  not  in¬ 
tended  to  address  the  question  of  the  impact  of 
compression  on  diagnostic  performance.  Its  pur¬ 
pose  was  merely  to  give  us  an  opportunity  to  test 
the  feasibility  of  the  techniques  we  believe  are 
appropriate  for  the  compression  of  mammo¬ 
graphic  data,  and  to  identify  the  range  of 
compression  ratios  that  merit  further  study. 
There  are  many  possibilities  for  further  refine¬ 
ments,  particularly  in  the  area  of  preprocessing 
the  data,  that  are  within  the  constraints  of  the 
standard,  and  these  will  likely  extend  the  useful 
range  of  compression  ratios. 

ICES 

results  for  still  picture  compression  algorithms  for  interna¬ 
tional  standardization.  Proc  IEEE  Global  Telecommunica¬ 
tions  Conference  &  Exhibition,  vol  2,  November  28- 
December  1,  1988,  pp  1022-1027 

4.  Daly  S:  Application  of  a  noise  adaptive  contrast 
sensitivity  function  to  image  data  compression.  Proc  SPIE 
1077:217-227,  1989 

5.  Rabbani  M,  Daly  S:  An  optimized  image  data  compres¬ 
sion  technique  utilized  in  the  Kodak  SV9600  still  video 
transceiver.  Proc.  SPIE  1071:246-256,  1989 


132 


GOOD,  MAITZ,  AND  GUR 


6.  Ngan  KN,  Leong  KS,  Harcharan  S:  Cosine  transform 
coding  incorporating  human  visual  system  model.  Proc 
SPIE  707:165-171,  1986 

7.  Good  WF,  Gur  D:  Quantization  techniques  for  the 
compression  of  chest  images  by  JPEG  type  algorithms.  Proc 
SPIE  1652:114-121,  1992 

8.  Boncelet  CG:  Some  uses  for  order  statistic  filtering  in 
image  compression.  Proc  SPIE  1247:51-57,  1990 


9.  Ishigaki  T,  Sakuma  S,  Ikeda  M,  Itoh  Y,  Suzuki  M,  Iwai 
S:  Clinical  evaluation  of  irreversible  image  compression: 
Analysis  of  chest  imaging  with  computed  radiography. 
Radiology  175:739-743,  1990 

10.  MacMahon  H,  Doi  K,  Sanada  S,  Montner  SM,  Giger 
ML,  Metz  CE,  Nakamori  N,  Yin  F-F,  Xu  X-W,  Yonekawa 
H,  Takeuchi  H:  Data  compression:  Effect  on  diagnostic 
accuracy  in  digital  chest  radiography.  Radiology  178:175- 
179,  1991 


ADDENDUM 


Copies  of  microfilm  referenced  in  this  report  can  be  obtained 
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